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Abstract 

Although the pathophysiology of facioscapulohumeral dystrophy (FSHD) has been controversial over the last 
decades, progress in recent years has led to a model that incorporates these decades of findings and is gaining 
general acceptance in the FSHD research community. Here we review how the contributions from many labs over 
many years led to an understanding of a fundamentally new mechanism of human disease. FSHD is caused by 
inefficient repeat-mediated epigenetic repression of the D4Z4 macrosatellite repeat array on chromosome 4, 
resulting in the variegated expression of the DUX4 retrogene, encoding a double-homeobox transcription factor, 
in skeletal muscle. Normally expressed in the testis and epigenetically repressed in somatic tissues, DUX4 expression 
in skeletal muscle induces expression of many germline, stem cell, and other genes that might account for the 
pathophysiology of FSHD. Although some disagreements regarding the details of mechanisms remain in the field, 
the coalescing agreement on a central model of pathophysiology represents a pivot-point in FSHD research, 
transitioning the field from discovery-oriented studies to translational studies aimed at developing therapies based 
on a sound model of disease pathophysiology. 
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Introduction 

The mutation causing the most common form of facios- 
capulohumeral muscular dystrophy (FSHD) was identi- 
fied over 20 years ago, yet for many of the ensuing years 
there was little or no consensus in the scientific commu- 
nity regarding the molecular pathophysiology of the dis- 
ease. Unlike other disease-causing mutations that disrupt 
the normal function of an identified gene, FSHD is caused 
by the loss of a subset of repeat units in the D4Z4 macro- 
satellite repeat array on chromosome 4 that does not 
disrupt the structure of any gene. This led to many differ- 
ent hypotheses regarding the mechanism of FSHD patho- 
physiology, and many disagreements among the researchers 
in this field. The absence of a generally accepted model 
for FSHD pathophysiology made it difficult to get support 
for research in FSHD and limited the interest in therapeutic 
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development. Over the last several years, several key ad- 
vances in FSHD research, many made possible by the 
steady improvement in technology, have identified the 
molecular and genetic causes of FSHD and clarified 
the mechanisms of pathophysiology, leading to the dis- 
covery that FSHD is a disease of inefficient repeat-mediated 
epigenetic repression of the DUX4 retrogene embedded in 
the D4Z4 repeat units. Here we will review the line of ex- 
perimental evidence that gradually led to this model of 
FSHD pathophysiology. As this model gains general 
acceptance in the field, greater attention and resources 
can now be devoted to therapeutic development, indi- 
cating that we have come to a pivotal moment in FSHD 
research. 

Review 

Clinical features of FSHD 

One of the earliest descriptions of FSHD was published in 
1885 by the French neurologists Landouzy and Dejerine 
[1]. The clinical syndrome was further characterized in a 
subsequent study of large Mormon families in Utah by 
Tyler and Stephens [2]. The most comprehensive study, 
describing the salient features of FSHD, was written by 
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George Padberg in 1982 as a dissertation thesis (http:// 
hdl.handle.net/1887/25818). In the vast majority of cases, 
FSHD is an autosomal dominant disease with a high fre- 
quency of de novo mutations [3]. However, about 5% of 
patients with clinical FSHD, termed FSHD2, are genetic- 
ally distinct with a more complex digenic inheritance pat- 
tern (see section on FSHD2) [4]. To date, based on a 
relatively small study, FSHDl and 2 appear to be clinically 
indistinguishable; however, larger studies are needed to 
confirm this observation [5]. 

Current estimates of the prevalence of FSHD range 
from 1:14,000 to 1:20,000 [6-9]. The age at disease onset 
ranges from infancy to middle age with the majority be- 
coming symptomatic in the second and third decade of 
life. Early estimates of disease penetrance were 95% by 
the age of 20 years but recent studies suggest that the 
penetrance might be lower at this age [10,11]. In most 
instances, FSHD presents with a distinct, regional, often 
asymmetric muscle weakness starting rostrally in the face 
and shoulder muscles and progressing caudally over time 
to involve the trunk and leg muscles [3]. The early in- 
volvement of the periscapular muscles result in the dis- 
tinctive profile of the shoulders of patients with FSHD 
with scapular winging, straight clavicles, and rounded 
shoulders [3]. The presence of a combination of scapular 
winging and facial weakness without other signs of muscle 
involvement, and an autosomal dominant family history, 
makes the diagnosis of FSHD all but certain. As the disease 
progresses, the muscles of the trunk and lower extremities 
become involved. Unlike other muscular dystrophies the 
extraocular muscles, pharyngeal muscles, and the cardiac 
muscle are spared [3]. Although rare, two distinguishing 
extramuscular manifestations in FSHD are the presence of 
a progressive high frequency hearing loss and a retinal ex- 
udative retinopathy (Coats disease), which if untreated can 
lead to blindness [12-15]. Symptomatic hearing loss and 
retinal vascular disease occurs almost exclusively in FSHD 
individuals with only one to three residual D4Z4 repeats 
(see nex section) and it is estimated that only about 1% of 
patients with FSHD develop Coats disease [13]. 

The spectrum of disease severity in FSHD varies widely 
with approximately 20% of genetically affected individuals 
remaining asymptomatic. Disease progression in general is 
relatively slow with estimates of an average loss of 5% of 
total strength per year as measured by manual muscle test- 
ing or quantitative myometry [13,16]. As a group, women 
are less severely affected and tend to have a later age at dis- 
ease onset [13]. About 20% of patients with FSHD above 
the age of 50 years become wheelchair dependent and 
patients with the smallest residual repeat arrays are most 
at risk of wheelchair dependence [13,16]. In general life 
expectancy is not reduced in FSHD although about 1% of 
individuals can develop severe restrictive lung disease re- 
quiring the use of a ventilator [17]. 



In general, FSHD muscle does not show distinguishing, 
disease-specific characteristics on histopathologic examin- 
ation. Unlike most dystrophies associated with structural 
protein defects, in FSHD the early myopathic changes are 
mild with relatively little fibrosis, muscle fiber hypertrophy, 
or central nucleation. Up to one-third of FSHD muscle bi- 
opsies show variable amounts of endomysial inflammation, 
often surrounding small endomysial blood vessels [18,19]. 
The inflammatory infiltrates are predominantly CD8+ with 
more prominent CD4+ T cells in the perivascular infil- 
trates [18,20]. Unlike polymyositis and inclusion body 
myositis, there does not appear to be a cytotoxic T-cell me- 
diated muscle fiber injury as no invasion of non-necrotic 
fibers is observed in FSHD. Other dystrophies, such as 
Duchenne dystrophy and dysferlinopathies, are associ- 
ated with inflammatory infiltrates but the predilection of 
the inflammatory infiltrates for the perivascular regions is 
unique to FSHD. This pathologic finding coupled with the 
occurrence of a retinal vasculopathy with an inflammatory 
component in FSHD and the demonstration of dysregu- 
lated vascular genes in FSHD muscle has raised the possi- 
bility that the T-cell mediated response may be directed 
against bloodvessels [18,21]. 

Loss of a subset of D4Z4 macorsatellite repeats causes 
FSHD 

FSHD was one of the first Mendelian disorders mapped 
by the use of microsatellite markers to the distal end of 
the long arm of chromosome 4 in a cohort of Dutch 
FSHD families in 1990 [22]. This mapping to 4q35 was 
soon confirmed by others [23]. Further studies showed 
that in most famflies FSHD was genetically linked to a 
size reduction of a polymorphic EcoRl fragment below a 
threshold of 38 kb. Both in FSHD families and in spor- 
adic cases, FSHD was consistently linked to this £coRI 
fragment of variably reduced size [24,25] . Soon thereafter, 
the nature of this enigmatic observation was resolved by 
showing that the £coRI fragment contained a tandem 
array of a variable number of 3.3 kb units, named D4Z4 
units, in a head-to-tail orientation [26]. The DNA content 
of the D4Z4 unit is complex and contains sequences typic- 
ally found in constitutive heterochromatin, but each unit 
also contains a copy of a homeobox sequence that was 
later characterized as the DUX4 retrogene [27-30]. 

D4Z4-like sequences are distributed over the genome 
and can be found in pericentromeric and subtelomeric 
domains, but in the subtelomeres of chromosomes 4q 
and lOq the D4Z4 units are arranged in perfect tandem 
arrays [31,32]. Within the control population, the D4Z4 
repeat array on chromosome 4 varies between 11 and 100 
units, while the array on chromosome 10 can vary from 
one to 100 units [33]. Most patients with FSHDl have one 
array of one to 10 units on chromosome 4 [34]; some pa- 
tients with clinical features resembling FSHD have been 
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reported carrying arrays of 11 units [35]. In approximately 
half of the de novo families, the mutation arose by a mi- 
totic contraction of the D4Z4 repeat array leading to the 
presence of somatic mosaicism for the FSHDl mutation 
in the proband, or in one of the clinically unaffected 
parents [36]. 

Units within an array are highly homologous, however, 
sequence polymorphisms between the units derived from 
chromosome 4 or 10 allow for correct chromosomal as- 
signment. Generally, chromosome 10-derived units are 
uniquely sensitive to digestion by the endonuclease Blnl 
while chromosome 4-derived units are sensitive to diges- 
tion by Xapl [37,38]. This differential sensitivity to these 
endonucleases, and other polymorphisms that were 
discovered later in and around the D4Z4 repeat, facil- 
itated studies into the dynamic nature of these repeats on 
both chromosomes. Based on several population studies a 
model emerged in which at least four subtelomeric ex- 
changes between chromosomes 4 and 10 occurred during 
recent hominoid evolution leading to a complex picture 
ranging from individuals having the expected genetic con- 
figuration of chromosome 4-type repeats on chromo- 
some 4 and 10-type repeats on chromosome 10, to 
individuals having translocated repeat arrays on either 
chromosome, to individuals showing evidence for car- 
rying repeat arrays that are mixtures of 4-type and 10- 
type repeat units [33,39-41]. 

Despite the apparently dynamic behavior of these sub- 
telomeric repeat arrays, D4Z4 repeat contractions on 
chromosome 4, but not on chromosome 10, were con- 
sistently identified in FSHD patients. This suggested that 
unique genetic features on chromosome 4 were neces- 
sary to cause FSHD, or alternatively, that sequences on 
chromosome 10 were protecting from pathogenicity. More- 
over, at least one D4Z4 unit was necessary to cause FSHD 
since monosomy of the distal end of chromosome 4, in- 
cluding the D4Z4 repeat array, did not result in FSHD 
[42]. Because of the high homology of the repeats on 
chromosomes 4 and 10, and the 40 kb sequences proximal 
to the repeat, different models were proposed to explain 
the apparent chromosome 4 specificity. One model pre- 
dicted that the D4Z4 repeat array acts as a barrier between 
the heterochromatic telomere, at 40 to 60 kb distance of 
the repeat, and proximal sequences. Upon repeat contrac- 
tion, it was proposed that this barrier function becomes 
incomplete leading the spreading of heterochromatin in 
the centromeric direction and subsequent silencing of 
proximally located genes that are unique to chromo- 
some 4. A second model predicted that the supposedly 
heterochromatic nature of D4Z4 itself was responsible for 
the regulation of cis-located genes by a looping or spread- 
ing mechanism and that upon contraction this regulation 
was impaired leading to the ectopic expression of proxim- 
ally located genes on chromosome 4. 



These models initiated the search for closely mapped 
genes and resulted in the identification of three genes 
and one pseudogene within an interval of 120 kb from 
the repeat. These included FSHD candidate region gene 1 
(FRGl) at 120 kb distance from the repeat, the TUBB4q 
pseudogene at 80 kb distance, the DUX4C gene originat- 
ing from an inverted and incomplete copy of the D4Z4 
unit located at the boundary of the homologous regions 
on chromosomes 4 and 10, and FRG2 at 35 kb distance 
and also present on chromosome 10 [43-46]. 

Many studies addressed the deregulation of proximally 
located genes with inconclusive results. An initial study 
reported the repeat length and distance dependent up- 
regulation of FRGl and FRG2 in muscle of FSHD patients 
[47]. However, follow-up studies by different groups and 
different techniques failed to confirm the upregulation of 
FRGl in FSHD muscle [21,48-50]. Also, chromatin studies 
of the D4Z4 repeat and immediately proximal sequences 
did not find evidence for a spreading of heterochromatin 
mechanism from the D4Z4 repeat, although some evi- 
dence for a role of D4Z4 in controlling FRGl expression 
by a looping mechanism was reported [48,51,52]. 

Although FRG2 was consistently reported to be over- 
expressed from chromosomes 4 and 10 in FSHD muscle 
[41,44,48], its involvement in FSHD was challenged by the 
identification of an FSHDl family in which the deletion not 
only involved the D4Z4 repeat array, but also eliminated 
FRG2 and DUX4C from the disease allele [44,47,50,53]. 
This suggested that, although the upregulation of FRG2 in 
FSHD remains poorly explained, it does not likely play an 
important role in FSHD pathology. 

An observation that challenged both models was the 
discovery of two major chromosome 4 haplotypes, called 
4A and 4B [54]. These two major variants of the distal 
end of chromosome 4 vary by small polymorphisms 
proximal and distal to the repeat, as well as in the repeat 
array itself. An important difference between the 4A and 
4B variants is that the 4A variant contains a distal se- 
quence called pLAM immediately followed by a beta sat- 
ellite repeat, which are both absent on 4B chromosomes. 
Using probes that recognize sequences distal to the re- 
peat, it was shown that both variants are almost equally 
prevalent in the population, but that only D4Z4 repeat 
contractions on 4A chromosomes are associated with 
FSHD, while contractions on 4B chromosomes do not 
cause disease [55,56]. This observation was difficult to 
unify with either of the two prevailing hypotheses explain- 
ing the chromosome 4 specificity. 

Coding and non-coding RNAs are produced from the 
D4Z4 region 

As noted above, each of the D4Z4 units on chromosome 4 
and 10 had been found to contain a copy of the DUX4 ret- 
rogene. Because the DUX4 open reading frame (ORF) was 
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conserved [27,57], there was debate regarding whether this 
was the result of positive selection for the function of the 
protein, particularly because DUX4 mRNA had not been 
detected in any primate tissues. The emerging model of 
RNA-mediated epigenetic repression of repetitive elements 
in model organisms also led to the hypothesis that D4Z4 
RNA transcription might be associated with the repeat 
heterochromatin, however, RT-PCR and RNA polymerase 
II chromatin immunoprecipitation found little-or-no tran- 
scription of this region in cells from control or FSHD 
individuals [58]. Subsequently, Dixit et al, [30] used 
transfections of the genomic locus to identify DUX4 mRNA 
transcripts that were both spliced and poly-adenylated at a 
site distal to the last D4Z4 unit, in a region referred to as 
pLAM, and RT-PCR was used to identify this mRNA in 
FSHD muscle cells (Figure 1). 

The production of a DUX4 mRNA with a poly- 
adenylation sequence in the pLAM region was con- 
firmed in a study that also showed evidence for long 
transcripts through the D4Z4 repeats in both the sense 
and anti-sense directions [29]. Some of the sense transcripts 
from internal repeats were identified that had spliced into 
the pLAM exon containing the poly-adenylation se- 
quence, indicating long transcripts spanning multiple 
D4Z4 units. In addition, small RNA fragments consistent 
with siRNAs or miRNAs were shown to be generated 
from these long transcripts, suggesting a possible role of 
RNA transcripts in epigenetic silencing of the locus as had 



been previously suggested, or possibly with other biological 
effects that can be mediated by small si or miRNAs. 

The Snider et al study [29] documented bidirectional 
RNA transcripts extending through multiple D4Z4 units 
and mapped the transcription start site of the DUX4 
containing transcript to a region immediately upstream 
of the DUX4 ORF. Block et al [59] showed that a region 
of D4Z4 distal to the DUX4 retrogene initiated tran- 
scripts in the anti-sense direction. Together these results 
suggest that transcripts might originate in each unit ei- 
ther near the promoter region of the DUX4 ORF or at a 
distal region and extend in each direction through mul- 
tiple repeats. A more recent study focused on the long 
sense transcript through the D4Z4 units and called it a 
long non-coding RNA named DBE-T [60]. The sequence 
of the D4Z4 has many extended ORFs but the non- 
coding appellation might have been chosen because of 
the demonstration that the transcript was chromatin as- 
sociated. The origin of this DBE-T transcript was mapped 
to the region centromeric to the D4Z4 repeats rather than 
to the promoters within D4Z4 described previously 
[29,30,59]. DBE-T was shown to be produced from FSHD 
and not control alleles, to recruit the ASHIL member of 
the trithorax group to the D4Z4 region, and to de-repress 
transcription of the DUX4 mRNA. Although the Cabianca 
et al. study [60] implicated a polycomb-mediated silen- 
cing of the non-contracted D4Z4 array, consistent with 
other prior studies, it differs from other models in that it 



■ im ■ ■ 



Chromosome 4 



D4Z4- Repeats 



PAS 



-TEL 



->9.8 kb 



-^4.4 kb 



->2.4 kb 



Figure 1 Schematic of D4Z4 locus on chiromosome 4. The D4Z4 locus is in tlie sub-telomeric region of 4q. Tlie figure sliows a tliree repeat 
D4Z4 array. CEN indicates tlie centromeric end and TEL indicates tlie telomeric end. Tlie DUX4 gene is sliown as a gray rectangle with exon 1 
and exon 2 in each repeat and exon 3 in the pLAM region telomeric to the last partial repeat (numbered 1, 2, and 3). PAS indicates the polyadenylation 
site on the permissive 4qA allele that is not present on the non-permissive 4qB allele or on chromosome 10. The arrowed lines represent: Blue, DBE-T 
transcripts (2.4, 4.4, and 9.8 kb) found in FSHD cells and reported to de-repress DUX4 expression [60]; Black and red, transcripts in the sense 
and anti-sense direction were detected in both FSHD and control cells [29] and might originate from the mapped sense promoters 
(black) [29,30] and anti-sense promoters (red) [59] with dashed lines indicating areas that might be degraded or produce si-like small 
RNAs. NDE, non-deleted element identified as the transcription start site for the DBE-T transcripts [60]. 
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identified a transcript originating in the region centro- 
meric to the D4Z4 array that was selectively expressed in 
FSHD cells and necessary for de-repression of the locus, 
rather than the prior study identifying transcripts originat- 
ing in the D4Z4 units in both control and FSHD cells that 
might contribute to RNA silencing of the locus [29] 
(see Figure 1). Further studies are necessary to deter- 
mine which of these two distinct models is correct. 

Variegated expression of DUX4 in FSHD muscle 

In contrast to some of the uncertainties regarding the 
molecular details of RNA-mediated regulation of D4Z4 
transcription, the expression of a DUX4 mRNA as a 
cause of FSHD has gained further support. While the 
Dixit et al study [30] identified a poly-adenylated DUX4 
mRNA only in FSHD muscle, the Snider et al study [29] 
identified DUX4 mRNA and other RNA transcripts in 
skeletal muscle cells from both control and FSHD af- 
fected individuals, albeit at higher levels in FSHD sam- 
ples. In addition this study identified several different 
splice forms of the DUX4 mRNA, including one that 
would produce the full-length predicted ORF (DUX4-fl) 
and one that would produce a truncated protein that 
lacked the carboxy- terminal region (DUX4-s). A subse- 
quent study [61] designed to measure the relative abun- 
dance of the DUX4-fl and DUX4-S mRNAs in skeletal 
muscle cells and muscle biopsies showed that the control 
biopsies expressed very low amounts of the DUX4-S tran- 
script encoding the truncated protein, whereas the FSHD 
samples expressed very low amounts of the DUX4-fl and 
sometimes the DUX4-S as well, indicating that the DUX4-fl 
mRNA was correlated with FSHD and might be a cause 
for FSHD. 

The transcripts for both DUX4-fl and DUX4-S, how- 
ever, were at extremely low abundance, requiring high 
cycle PGR for reliable detection. It was unclear how such 
a low abundance mRNA, present at less than one copy 
per nucleus, could cause the disease. This was addressed 
using small-pool PGR and immunodetection of the DUX4 
protein to show that a small subset of nuclei were produ- 
cing relatively abundant DUX4-fl mRNA and protein with 
associated nuclear changes consistent with a pre-apoptotic 
effect [61]. Therefore, the low levels of DUX4-fl mRNA in 
FSHD muscle represented relatively high expression from 
a small subset of nuclei consistent with variegated expres- 
sion patterns previously associated with repeat-mediated 
epigenetic repression [62]. In cultured mono-nuclear 
replicating myoblasts, isolated cells were found to ex- 
press DUX4, indicating that the repression of DUX4 
was occasionally lost and resulted in a burst of DUX4 
expression with either subsequent silencing or the death 
of the cell. Myotubes have multiple nuclei in the same cell 
and expression of an mRNA from one nucleus will distrib- 
ute the mRNA and protein to adjacent nuclei [63] and. 



indeed, the myotubes with a nucleus expressing DUX4 
show the distribution of the protein to a regional domain 
of nuclei [64]. 

Genetic confirmation that DUX4 mRNA causes FSHD 

The previously mentioned observation that D4Z4 repeat 
contractions on 4A chromosomes were associated with 
FSHD, while contractions on 4B chromosomes did not 
cause disease [55,56] led to the genetic confirmation that 
the DUX4 mRNA was a necessary cause of FSHD [65]. 
First, a moderately polymorphic imperfect dinucleotide 
repeat, or simple sequence length polymorphism (SSLP), 
was discovered immediately proximal to the repeat, lead- 
ing to the discovery that most patients had a contraction 
on one of the most prevalent 4A haplotypes with an 
SSLP length of 161 base pairs (4A161) [40]. Gontractions 
on other 4A haplotypes like 4A159 and 4A168 were also 
associated with FSHD, with the notable exception that 
contractions on 4A166 chromosomes were not associ- 
ated with FSHD [40,41]. Subsequent sequencing of the 
proximal and distal ends of the D4Z4 repeat array and 
its flanking sequences on these informative haplotypes 
allowed the identification of a series of SNPs and other 
polymorphisms that were consistently found on FSHD- 
permissive haplotypes, but not on non-permissive haplo- 
types [41]. 

The most striking difference between permissive and 
non-permissive haplotypes was the polymorphic nature 
of the DUX4 poly-adenylation sequence previously iden- 
tified [29,30]. While permissive haplotypes consistently 
contained a poly-adenylation sequence for the DUX4 
mRNA, non-permissive haplotypes carried SNPs that dis- 
rupted this poly-adenylation sequence. This demonstrated 
that polymorphisms in the DUX4 poly-adenylation se- 
quence accounted for the restricted haplotype specificity 
of FSHD. 

Finally, detailed genetic analysis of FSHDl families 
with D4Z4 repeat array contractions that consisted of 
mixtures of 4- type and 10-type units showed that all 
pathogenic repeat arrays ended with a classical FSHD- 
permissive 4A sequence signature containing the DUX4 
poly-adenylation sequence, even when translocated to 
chromosome 10 [65]. Together, these studies showed 
that the DUX4 mRNA poly-adenylation sequence was 
necessary for FSHD, providing genetic proof that the 
DUX4 mRNA was necessary for FSHD. 

FSHD2: Genetic confirmation of decreased epigenetic 
repression as the cause of FSHD 

The first evidence for an epigenetic disease mechanism in 
FSHD came from D4Z4 GpG methylation studies [66]. 
Making use of the diagnostic pl3E-ll probe and a com- 
bination of methylation-insensitive and methylation- 
sensitive endonucleases, it was shown that the contracted 
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repeat array was hypomethylated compared to normal- 
sized arrays [66,67]. However, FSHD2 individuals had a 
classical FSHD phenotype and did not have a D4Z4 repeat 
array contraction, but did show a strong reduction of 
D4Z4 methylation, suggesting that the common feature of 
FSHDl and FSHD2 was decreased epigenetic repression 
of D4Z4 [66,68]. Methylation analysis of the D4Z4 repeat 
array in FSHD2 patients by bisulfite sequencing confirmed 
that the D4Z4 repeat is hypomethylated but that this hy- 
pomethylation is not uniformly observed across the entire 
unit [69]. 

Follow-up studies in a larger series of FSHD2 individuals 
showed very low D4Z4 methylation levels at the repeats of 
all four chromosomes but each affected individual had 
at least one FSHD-permissive chromosome 4 haplotype. 
Also other changes in epigenetic markers of the D4Z4 re- 
peat, such as the binding of heterochromatin protein 1 
gamma, the cohesin complex and some histone modifica- 
tions, were similarly affected [70,71]. 

D4Z4 methylation analysis in family members of 
FSHD2 individuals showed that D4Z4 hypomethylation 
segregated as a dominant trait independent from disease 
presentation in some of these families. Digenic inherit- 
ance of the D4Z4 hypomethylation trait with an FSHD 
permissive allele was necessary for FSHD2, whereas indi- 
viduals with hypomethylated non-permissive chromo- 
somes 4 alleles remained unaffected [4]. 

Subsequent whole exome sequencing in selected FSHD2 
families identified mutations in the Structural Main- 
tenance of Chromosomes Hinge Domain Containing 1 
(SMCHDl) gene in six out of seven cases [4], and analysis 
of a larger cohort confirmed that SMCHDl mutations ac- 
count for approximately 85% of FSHD2 families. In mice, 
Smchdl was shown to be involved in gene repression by 
the establishment and/or maintenance of CpG methyla- 
tion at loci that are monoallelically expressed, such as a 
restricted set of genes on the inactive X chromosome in 
females and the protocadherin gene cluster [72-75]. 

Analogous to the studies in mice, in muscle cell cul- 
tures from FSHD2 individuals and controls it was shown 
that SMCHDl binds to the D4Z4 repeat and that there 
is reduced SMCHDl binding in SMCHDl mutation car- 
riers. Knock down and exon skip experiments confirmed 
that in control muscle cells containing an FSHD-permissive 
chromosome the reduction of SMCHDl levels led to the 
derepression of DLIX4, providing a mechanistic link be- 
tween mutations in SMCHDl on chromosome 18 and 
DUX4 reactivation from the D4Z4 repeat on chromo- 
some 4 [4] . Thus, FSHD2 is caused by the digenic inher- 
itance of a normal-sized D4Z4 repeat array on a DUX4 
poly-adenylation sequence containing chromosome 4 and 
a heterozygous SMCHDl mutation on chromosome 18. 

Genetic variation in SMCHDl not only explains the 
majority of FSHD2 individuals, but contributes to the 



variability in onset and progression of the disease in some 
FSHDl families. Examination of three FSHDl families 
with a borderline D4Z4 repeat of 9 units and an unusually 
severely affected proband in each family, showed that the 
more severely affected proband had both the D4Z4 con- 
traction causing FSHDl and a mutation in SMCHDl 
causing FSHD2 [76], indicating that SMCHDl mutations 
modify the penetrance or severity of FSHDl mutations. 
Indeed, knockdown of SMCHDl in FSHDl muscle cells 
led to an increase in DUX4 mRNA and its targets, indi- 
cating that both mutations act synergistically, and that 
FSHDl and FSHD2 result from chromatin relaxation and 
derepression of DUX4, It will be interesting to explore to 
what extent natural genetic variation in the SMCHDl 
locus contributes to clinical variability in a larger cohort 
of FSHDl families. 

A developmental model for FSHD 

Increasing evidence indicated that FSHD was caused by 
the decreased epigenetic repression of the D4Z4 array and 
the variegated expression of the DUX4 mRNA in FSHD 
muscle, but neither the normal biological role of DUX4 
nor the consequences of its expression in skeletal muscle 
were known. The retro transposition of the DUXC gene 
created the DUX4 retrogene at the root of the primate 
lineage [77]. Primates have maintained DUX4 and lost 
DUXC, suggesting that DUX4 might have replaced the 
function of DUXC and might have also had a selective ad- 
vantage for primates. 

To become a retrogene, the parental gene must be 
expressed in the germline, and indeed DUXC is expressed 
in the canine testis (SJT, unpublished data) and the DUX4 
retrogene is expressed in the human testis [61]. Immuno- 
detection showed DUX4 in luminal cells of the testis, con- 
sistent with spermatogonia and primary spermatocytes. 
Although additional studies are necessary to confirm the 
cell type expressing DUX4 in the testis, these results are 
consistent with expression of both DUXC and DUX4 in 
germline cells. 

In contrast to skeletal muscle, DUX4 mRNA tran- 
scripts from both FSHD-permissive and non-permissive 
alleles are identified in testes. Testes DUX4 transcripts 
from both chromosome 4 and chromosome 10 splice 
exon 2 to novel distal exons and use a poly-adenylation 
sequence that is approximately six kb telomeric to the 
exon 3 poly-adenylation sequence used in FSHD muscle 
[61]. No skeletal muscle transcripts, either in control or 
FSHD tissues, have been identified that use these distal 
exons and poly-adenylation sequence and they appear to 
be testis specific, permitting stable DUX4 mRNA expres- 
sion from both FSHD-permissive and non-permissive al- 
leles in the testis. 

Therefore, DUX4 appears to be expressed in the germ- 
line and epigenetically repressed in somatic tissues, likely 
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through a repeat-mediated epigenetic silencing pathway. 
FSHD is caused by the inefficient epigenetic repression in 
skeletal muscle and the low-level variegated expression of 
the DUX4 mRNA and protein. This developmental model 
is supported by the demonstration that iPS cells from 
either FSHD or control individuals express the DUX4 
mRNA, whereas DUX4 expression is suppressed in em- 
bryoid bodies derived from control cells and persists in 
embryoid bodies derived from FSHD cells [61]. 

Consequences of DUX4 expression in skeletal muscle 

DUX4 is a member of the double-homeobox family of 
transcription factors that arose in placental mammals. 
The role of DUX4 and this family of transcription factors 
in developmental biology remains unknown, but the con- 
sequences of the expression of DUX4 in skeletal muscle 
has been described. Earlier studies showed that expression 
of DUX4 in muscle cells induces apoptosis [78] that is 
dependent on an intact p53 [79] . An expression array and 
ChlP-seq analysis of human skeletal muscle cells trans- 
duced with DUX4 showed that DUX4 bound to a consen- 
sus sequence containing two homeodomain motifs and 
activated transcription of several hundred genes [80]. 
Consistent with a role in germline biology, DUX4 acti- 
vated expression of a large number of genes normally 
expressed in the testis and in stem cell biology, as well as 
genes associated with RNA and protein processing, such 
as splicing factors and ubiquitin ligases. 

Many expression array studies and some protein ex- 
pression studies have been performed on control and 
FSHD muscle biopsies, but a strong molecular signature 
has been elusive. Some studies have focused on elevation 
of specific genes, such as CRYM or PITXl [30,81], or as- 
pects of the program of muscle differentiation [82], or a 
molecular signature based on multiple genes without a 
clear regulatory association [83]. The recognition that 
DUX4 is expressed in a variegated pattern in cultured 
FSHD muscle cells might explain some of the difficulty 
identifying a clear gene expression signature when test- 
ing either cultures of FSHD cells or biopsies from FSHD 
muscle where DUX4 might be expressed in only a mi- 
nority of the nuclei. Indeed, when expression array data 
are analyzed based on a priori knowledge of genes in- 
duced by DUX4-transduction in skeletal muscle cells, 
there is a significant elevation of these genes in FSHD 
muscle ([83] and SJT unpublished data); and a focused 
analysis shows that DUX4 and DUX4 target genes are 
expressed in fetal FSHD muscles [84]. 

Approximately one-third of DUX4 binding sites are in 
repetitive elements [85,86]. DUX4 binds the LTR element 
of the MaLR and HERVL family of retrotransposons and 
activates transcription of these endogenous repetitive 
elements. Interestingly, some of the genes that DUX4 
regulates are initiated from the LTR elements of these 



retrotransposons (Figure 2). These retrotransposon sub- 
families were last mobilized at the root of the primate 
lineage, at roughly the same time that DUX4 retrotran- 
posed from DUXC, These new primate-specific binding 
sites created a primate-specific gene network for DUX4, 
in addition to the network that evolved for DUXC prior to 
the last mobilization of these repeats in primates. It is 
important, therefore, to recognize that expression of 
DUX4 in non-primate cells will not recapitulate the gene 
network activated in human cells, consistent with the 
observation that there is only a partial overlap of genes 
regulated by DUX4 in human and mouse cells [86,87]. 
Therefore, murine models of the transcriptional activity or 
pathophysiology of human DUX4 expression need to be 
carefully designed and interpreted. 

Candidate mechanisms of disease 

The consequences of DUX4 expression in skeletal muscle 
cells suggest several viable pathophysiological mechanisms 
for FSHD. Skeletal muscle cell apoptosis is perhaps the 
most dramatic consequence of DUX4 expression [78]. 
The apoptosis appears p53-dependent [79] but the specific 
molecular pathways remain to be defined. As noted above, 
DUX4 activates a stem-cell-like transcriptional program 
in skeletal muscle cells and it is possible that the incom- 
patibility of these two differentiation programs is sufficient 
to induce an apoptotic response, similar to the older ob- 
servations that promoting cell cycle in differentiated 
muscle cells leads to apoptosis [88]. Apoptotic loss of 
muscle nuclei that accumulates over time might therefore 
contribute to FSHD pathophysiology. In addition, DUX4 
alters expression of genes involved in RNA splicing and 
processing, ubiquitin ligases implicated in atrophy, and 
many non-coding RNAs, any or all of which might con- 
tribute to FSHD pathophysiology. 



annotated 
transcript 



DUX4-induced 
transcript 




Figure 2 Schematic of novel transcription start sites regulated 
by DUX4. Upper panel shows the standard transcriptional start 
site for HEYl and its intron-exon structure in green. The lower 
panel shows the transcript induced by DUX4 that originates in a 
retrotransposon LTR (blue) and splices into novel exons (uncolored) 
and the annotated HEYl exons (green) to produce a predicted protein 
similar to HEYl. 
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In addition to damaging skeletal muscle, the expres- 
sion of DUX4 might also inhibit normal muscle regener- 
ation. DEFB103 is strongly activated by DUX4 in skeletal 
muscle cells [80]. DEFB103 is a member of the defensin 
family of small peptides and has been shown to inhibit the 
innate immune response and also to be an antagonist lig- 
and for the CXCR4 receptor [89-91]. CXCR4 is expressed 
in activated muscle satellite cells and myoblasts. CXCR4 
signaling has been shown to be necessary for myoblast 
migration and muscle cell differentiation [92,93], and ex- 
ogenous DEFB103 inhibits muscle cell differentiation and 
the innate immune response in skeletal muscle cells [80]. 
Therefore, the induction of DEFB103 by DUX4 might pre- 
vent normal muscle regeneration and/or modulate the re- 
sponse of skeletal muscle to an immune stimulus. 

A plausible model for a primary immune response as a 
component of FSHD pathophysiology also emerges from 
the identification of DUX4 regulated genes. Proteins 
expressed only in the germline can escape surveillance 
by the immune system since the germline is an immune- 
privileged site similar to the brain, and induce an im- 
mune response when mis-expressed in somatic tissues. 
This has been well studied in cancer biology where the 
mis-expression of germline genes induces an anti-cancer 
immune response. This group of genes is collectively re- 
ferred to as cancer-testis-antigens. Therefore, the DUX4- 
induced mis-expression of these genes in FSHD muscle 
would be expected to induce an immune response. 
Whether this is the basis of the immune cell infiltrate as- 
sociated with FSHD histology remains to be determined. 

Interestingly, imaging studies in FSHD also support the 
possibility that an immune process might have a primary 
role in its pathophysiology. Recent studies have revealed 
an association between MRI characteristics consistent 
with an inflammatory process and the presence of infiltrat- 
ing T-cells in FSHD muscle [20]. Then a subsequent study 
identified DUX4 regulated genes as mis-expressed in 
some FSHD muscles with inflammatory MRI charac- 
teristics [94]. Together with other similar studies that indi- 
cate edematous/inflammatory MRI characteristics progress 
to fatty infiltration [95-97], these studies suggest the possi- 
bility that DUX4 target gene expression and T-cell infil- 
trates might correlate with inflammatory characteristics 
on MRI. Further studies correlating the molecular charac- 
teristics of FSHD muscle with early MRI changes will be 
revealing in this regard. 

Another broad category of DUX4-induced transcripts 
that might contribute to FSHD pathophysiology are re- 
petitive elements, non-coding RNAs, and novel first exons 
originating in remnants of retrotransposon sequences 
[85]. DUX4 massively induces the pericentromeric satellite 
HSATII sequences that are also over- expressed in many 
cancers [98] and activates expression of endogenous ret- 
rotransposons and transcripts driven by isolated LTR 



elements that are the remnants of retrotransposons [85]. 
These RNAs and novel protein encoding transcripts could 
have biological activity related to FSHD pathophsysiology 
and need to be studied further. 

An important observation was that some unaffected 
relatives of FSHD individuals express DUX4 mRNA in 
their skeletal muscle cells [99]. Although much more 
needs to be done to characterize these individuals, such 
as measuring the DUX4 protein and target genes, this 
observation suggests that other factors are necessary to 
initiate the disease process. Determining whether these 
are modifier loci that protect the muscle cell in a cell- 
autonomous or non-autonomous manner, for example, 
by preventing apoptosis or modulating an immune re- 
sponse, will greatly advance the understanding of FSHD 
pathophysiology. 

Therapeutic opportunities 

The emerging consensus on mechanisms of FSHD patho- 
physiology greatly clarifies the approach to therapeutic in- 
terventions. The opportunities are clear: (1) enhance the 
epigenetic repression of the D4Z4; (2) target the DUX4 
mRNA, including altering splicing or polyadenylation; 
(3) block the activity of the DUX4 protein; or (4) inhibit 
the DUX4-induced process, or processes, that leads to 
pathology. 

Drugs that decrease epigenetic repression are now in 
wide clinical use, such decitabine, an inhibitor of DNA 
methylation, or SAHA, an inhibitor of histone de-acetylases; 
yet pharmaceutical companies have not prioritized the de- 
velopment of drugs that enhance epigenetic repression, 
partly due to the concern that these drugs would also sup- 
press the class of tumor suppressor genes, such as pl6, 
that are epigenetically repressed in many cancers. Al- 
though this was a reasonable concern, the demonstration 
that there are distinct molecular pathways for repeat- 
mediated epigenetic repression that do not target the 
more studied CpG island repression [71-74] suggests that 
this opportunity should be more actively explored. Fur- 
thermore, there is substantial support that the normal 
repeat-mediated repression pathways function through 
small RNA intermediates. The generation of small RNAs 
from the D4Z4 region [29] suggests that these might be 
mediators of the D4Z4 epigenetic repressive pathway and 
might be exploited to silence this region. 

A more conventional approach would be to target the 
DUX4 mRNA and prevent it from making the DUX4 
protein. Oligonucleotide-based therapies have already 
been developed for Duchenne muscular dystrophy [100], 
myotonic dystrophy [101], and other diseases [102,103], 
and many biotechnology companies are focusing on 
these approaches for a multiplicity of diseases. Their ap- 
plication to FSHD is both inevitable and welcome as a 
possibly rapid entry to clinical trials. In addition, some 
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companies have shown the feasibility of screening for 
small molecules that might alter RNA splicing or poly- 
adenylation of a specific transcript, and this is also a 
possible approach for FSHD therapies. 

While there are few drugs that block the activity of 
transcription factors, there are some examples [101] and 
drugs that block kinase cascades that might modify tran- 
scription factor activity are well known. Therefore it is 
not unreasonable to suppose that a drug blocking DUX4 
transcriptional activity might be identified. 

Currently, it is more difficult to imagine a roadmap to 
therapy by targeting a specific pathogenic pathway in- 
duced by DUX4, largely because there is no method to 
prioritize the multiple pathways that might lead to the 
FSHD pathophysiology. Each of these pathways might be 
effectively targeted by candidate therapies, but a human 
trial seems necessary to determine their relative import- 
ance. In contrast to the strong therapeutic justification 
for preventing the expression of DUX4 or its activity as 
a transcription factor, targeting individual pathways 
downstream of DUX4 will be partly an experiment to 
determine their relative role in disease pathogenesis. 

Cell and animal models 

The earlier models for FSHD focused on the possibility 
that one of the closely linked genes might be deregulated 
in FSHD muscle. For some time the Myd mouse was con- 
sidered a strong candidate because the Myd locus mapped 
to an apparently syntenic region on mouse chromosome 8 
[104], but this was soon discarded as a model for FSHD 
with the identification of a mutation in the glycosyltrans- 
ferase Large [105]. 

As mentioned, cellular models were first used to test 
the effects of DUX4 expression in a myogenic context. 
Transient transfection studies of D4Z4 in human cells 
showed that DUX4 is a nuclear protein that, when over- 
expressed, induces cell death [78]. An isogenic myoblast 
expression screen showed that DUX4 expression not only 
causes repression of glutathione redox pathway compo- 
nents and sensitivity to oxidative stress, but also repres- 
sion of MyoD and MyoD targets with impaired myogenic 
differentiation. These effects could be neutralized by over- 
expression of Pax3 or Pax7 [106]. Detailed analysis of the 
transcriptional landscape of D4Z4 further identified an in- 
ternal DUX4 methionine producing a C-terminal DUX4 
polypeptide that is sufficient to inhibit myogenesis in vivo 
at a step between MyoD transcription and the activation 
of MyoD target genes [29]. 

Many other cellular models addressed the consequence 
of DUX4 expression, either by ectopically expressing 
DUX4, or by comparing FSHD muscle cell cultures with 
controls [84,86,107-110]. These cell models are also 
increasingly used for the identification of molecules 
that can suppress DUX4 activity [111,112], and to 



understand the regulation of DUX4 expression and 
activity [59,113-115]. 

Taking advantage of the high frequency of somatic 
mosaicism for the D4Z4 repeat array contraction, iso- 
genic clonal muscle cell lines were established from a 
muscle biopsy of a mosaic FSHD individual that only 
differ by the presence or absence of a D4Z4 repeat array 
contraction [107]. These cell lines were useful to demon- 
strate that the burst-like feature of DUX4 expression is 
an intrinsic feature of the D4Z4 locus and, for the first 
time, showed that immortalized FSHD muscle cells can 
participate in muscle regeneration in vivo in immunode- 
ficient host mice to overcome some of the limitations 
caused by the hominoid-specific features of DUX4. This 
was elegantly supported by a human skeletal muscle 
xenograft model in which human muscle biopsies were 
transplanted into muscle of immunodeficient host mice 
[116]. Muscle biopsies of FSHD individuals were shown 
to fully integrate and to express known human DUX4 
biomarker genes. 

Studies in zebrafish and Xenopus confirmed the tox- 
icity of DUX4 in vivo [29,117] and its overexpression in 
muscle of zebrafish by transposon-mediated transgenesis 
or mouse muscle by adeno-associated viral vectors dem- 
onstrated the DNA binding-dependency of DUX4 toxicity 
[79,117,118]. This toxicity was also dependent on p53, as 
both in vitro and in vivo, DUX4 toxicity was overcome by 
the absence of p53. In an independent study, microinjec- 
tion of small amounts of human full-length DUX4 mRNA 
into fertilized zebrafish eggs caused features reminiscent 
of FSHD such as abnormalities of fin, facial, and trunk 
muscles, and mislocalization of myogenic cells outside so- 
mite boundary. Interestingly, these abnormalities could be 
rescued by introducing DUX4-S that lacks the putative 
DUX4 transactivation domain [118]. Moreover, DUX4- 
induced damage in the mouse muscle seems to be revers- 
ible as RNA interference (RNAi)-based DUX4 therapy 
corrected the myopathic changes caused by DUX4 expres- 
sion in the muscle [119]. 

In a more recent study, a different approach was taken 
by introducing the entire FSHDl locus in the mouse 
genome [87]. One mouse line had a D4Z4 repeat array 
of 2.5 units (D4Z4-2.5), while the second line carried an 
array of 12.5 units together with centromeric DNA that 
included the FRG2, DUX4Q and FRGl loci. While nei- 
ther of these mice produced a clear muscle phenotype, 
these mice do recapitulate many of the D4Z4 regulatory 
mechanisms first reported in humans. D4Z4-12.5 mice 
have a repressed D4Z4 chromatin structure in somatic 
cells and only express DUX4 at consistently detectable 
levels in germline tissues while in D4Z4-2.5 mice, there 
is a partial opening of the chromatin structure similar to 
FSHD patients and derepression of DUX4 leading to the 
molecular hall mark of the disease with few nuclei 
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expressing high amounts of DUX4 protein. A possible ex- 
planation for the lack of muscle pathology could be that 
human DUX4 does not activate the same gene network in 
murine cells, in part due to the retrotransposon-mediated 
spread of its binding sites in primates as discussed above, 
and possibly for other reasons yet to be determined. 

FSHD: A consensus model with minor areas of 
disagreement 

There is increasing consensus on the primary disease 
mechanism of FSHDl and FSHD2. Since the publication 
of the unifying genetic mechanism predicting a promin- 
ent role for the distal copy of the DUX4 retrogene and 
the polymorphic DUX4 polyadenylation sequence [65], 
efforts are largely focusing on the D4Z4 chromatin 
changes and the consequences of somatic DUX4 expres- 
sion [60,69,71,80]. Also, the identification of the FSHD2 
gene, SMCHDl [4,76], causing similar changes in D4Z4 
chromatin structure, and derepression of DUX4, strongly 
support the involvement of polyadenylated DUX4 tran- 
scripts in FSHD pathophysiology. 

However, there are still observations and areas of dis- 
agreement that need to be addressed. Based on the SSLP 
immediately proximal to the repeat, a number of chromo- 
some 4A and 4B subclasses were identified, of which a 
subgroup was shown to be permissive for FSHD (most 
commonly 4A161). All permissive subclasses have the 
polymorphic DUX4 polyadenylation sequence. One not- 
able exception is the 4A166 haplotype. In a Dutch study 
[56], D4Z4 contractions on the 4A166 haplotype did not 
result in FSHD despite the presence of the DUX4 polyade- 
nylation sequence. An Italian study [11], however, identi- 
fied a considerable number of individuals with FSHD 
having a contraction on the 4A166 haplotype, concluding 
that short D4Z4 repeats on this haplotype can be patho- 
genic. However, the frequency of the 4A166 haplotype in 
the population was very different in both studies suggest- 
ing that perhaps a technical issue with the determination 
of the haplotype may be the cause of this apparent dis- 
crepancy. Despite this controversy about the pathogenicity 
of the 4A166 haplotype, both studies showed that the 
great majority of FSHDl patients have a D4Z4 contraction 
on an FSHD-permissive allele defined by the presence of 
the DUX4 polyadenylation sequence. One direction of fu- 
ture research might be to investigate the role of disease 
haplotype-specific sequence variants in addition to the 
DUX4 polyadenylation sequence in the processing of 
DUX4 mRNA to better understand the essential genetic 
features of an FSHD permissive allele. 

Another area of future research is the role of proxim- 
ally located genes such as FRGl, FATl, and DUX4C in 
FSHD pathogenesis. Although the genetic data in FSHD 
families do not provide strong support for a critical role 
for these genes and point to a primary role for the distal 



unit of the D4Z4 repeat, some studies have provided evi- 
dence for deregulation of proximally located genes in 
FSHD [51,52,120,121]. In addition, studies in cell and ani- 
mal models provided evidence that overexpression of these 
proteins can lead to muscle cell pathology. One of the best 
studied mouse models is a transgenic animal overex- 
pressing human FRGl at different levels in skeletal muscle 
[122]. This mouse developed an FRGl dose-dependent 
muscle pathology. Another intriguing mouse model with a 
deregulation of the expression of the protocadherin gene 
FATl (located several megabases centromeric to D4Z4) 
also shows a muscle pathology [123]. Also in FSHD fe- 
tuses, FATl expression seems to be disturbed but whether 
this is a direct consequence of D4Z4 contraction [119], or 
perhaps a more downstream effect remains to be estab- 
lished. The absence of consistent data that these genes are 
indeed deregulated in FSHD muscle challenges their in- 
volvement, although the FRGl mouse model was useful 
for a proof-of-concept study that knocking down of FRGl 
in these mice by viral shRNA delivery prevented or re- 
stored muscle pathology [124]. 

The role of non-coding RNAs in the regulation of the 
D4Z4 chromatin structure also needs further studies. 
While there is agreement that the D4Z4 region is broadly 
transcribed and produces both sense and anti-sense RNAs 
[29], one study found these transcripts from D4Z4 arrays 
on chromosomes 4 and 10 and from both wild-type and 
disease-associated alleles and suggested that these RNAs 
and the small RNA fragments generated from them might 
be involved in epigenetic silencing of the region [29]; 
whereas another study found a long- RNA transcript 
specifically from the contracted allele and showed that it 
had a derepressive role in the region [60]. Further studies 
will be necessary to reconcile these apparently different 
conclusions. 

Finally, variability in onset and progression is one of the 
clinical hall marks of FSHD, already noted in the first de- 
scription of FSHD, and recently further highlighted in a 
large genotype-phenotype study [125]. Some attempts 
have been made to correlate the phenotype with the re- 
sidual D4Z4 repeat size and with epigenetic changes of 
the repeat [71,126-128]. These studies show that patients 
with 1 to 3 units are typically severely affected cases while 
the disease spectrum in individuals with 4 to 10 units can 
vary from asymptomatic gene carriers to severely affected 
patients. The cause for this wide variability in clinical 
presentation is still poorly understood. A recent paper by 
Sacconi et al [76], showing that mutations in the FSHD2 
gene SMCHDl act as a disease modifier in FSHDl fam- 
ilies suggests that epigenetic variation may, at least in 
part, underlie the clinical variability of FSHD. It will be 
important to further study the role of SMCHDl and 
other epigenetic modifiers in relation to clinical vari- 
ability of FSHD. 
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Conclusions 

Although some areas of disagreement remain, there is 
increasing agreement and consensus on a model of 
FSHD pathophysiology that accounts for nearly all of the 
current experimental findings. This model is based on 
the inefficient epigenetic repression of the DUX4 retro- 
gene in the D4Z4 macrosattelite repeat adjacent to a 
polymorphic poly-adenylation sequence in the subtelo- 
meric region of chromosome 4, and results in variegated 
expression of DUX4 in skeletal muscle nuclei in FSHD. 
The genes regulated by the DUX4 transcription factor 
suggest several plausible mechanisms of disease patho- 
physiology, including apoptosis, inhibition of regeneration, 
and a primary immune response. The direct transcrip- 
tional consequences of the expression of the DUX4 tran- 
scription factor in postmitotic muscle are becoming 
known, and current efforts are focusing on translating this 
knowledge towards identification of useful biomarkers 
and understanding the pathophysiological consequences. 
Further pre-clinical and clinical studies will be necessary 
to determine whether FSHD results from a single domin- 
ant pathway modulated by DUX4, or whether it reflects 
damage from multiple different pathways. Whichever is 
the case, the development of a generally accepted and co- 
herent model for FSHD provides new avenues for thera- 
peutic development. Although no single FSHD animal 
model exists besides humans, multiple different cell and 
non-human animal models that embody distinct compo- 
nents of the pathophysiological pathways have already 
been developed and can be used for testing candidate 
therapies. In summary, we have reached a pivotal mile- 
stone in FSHD research when the field is coming to agree- 
ment on a consensus model of pathophysiology and can 
confidently use this model as the basis for therapeutic de- 
velopment. Based on the rapid advances in FSHD research 
and the new efforts at drug development, it is safe to con- 
clude that clinical trials are just around the corner. A 
major challenge for the coming years will be to put in 
place the elements needed for clinical trials, which will in- 
clude: (1) establishment and maintenance of patient regis- 
tries based on globally accepted datasets; (2) development 
and validation of patient- relevant outcome measures; and 
(3) further development and validation of biomarkers, 
both molecular and radiologic [129]. 
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